Network growth for enhanced natural selection 
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Natural selection and random drift are competing phenomena for explaining the evolution of 
populations. Combining a highly fit mutant with a population structure that improves the odds 
that the mutant spreads through the whole population tips the balance in favor of natural selection. 
The probability that the spread occurs, known as the fixation probability, depends heavily on how 
the population is structured. Certain topologies, albeit highly artificially contrived, have been shown 
to exist that favor fixation. We introduce a randomized mechanism for network growth that is loosely 
inspired in some of these topologies' key properties and demonstrate, through simulations, that it 
is capable of giving rise to structured populations for which the fixation probability significantly 
surpasses that of an unstructured population. This discovery provides important support to the 
notion that natural selection can be enhanced over random drift in naturally occurring population 
structures. 
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Networks of agents that interact with one another un- 
derlie several important phenomena, including the spread 
of epidemics through populations the emergence of 
cooperation in biological and social systems [1, U, 13, the 
dynamics of evolution and various others [3, [1]. 

Typically, the dynamics of such interactions involves the 
propagation of information through the network as the 
agents contend to spread their influence and alter the 
states of other agents. In this letter, we focus on the 
dynamics of evolving populations, particularly on how 
network structure relates to the ability of a mutation to 
take over the entire network by spreading from its node 
of origin. 

In evolutionary dynamics, the probability that a muta- 
tion occurring at one of a population's individuals even- 
tually spreads through the entire population is known as 
the mutation's fixation probability, p. In an otherwise 
homogeneous population, the value of p depends on the 
ratio r of the mutant's fitness to that of the other in- 
dividuals, and it is the interplay between p and r that 
determines the effectiveness of natural selection on the 
evolution of the population, given its size. In essence, 
highly correlated p and r lead to a prominent role of 
natural selection in driving evolution; random drift takes 
primacy, otherwise [9]. 

Let P be a population of n individuals and, for individ- 
ual i, let Pi be any nonempty subset of P that excludes i. 
We consider the evolution of P according to a sequence 
of steps, each of which first selects i E P randomly in 
proportion to i's fitness, then selects j G Pi randomly in 
proportion to some weighting function on Pi , and finally 
replaces j by an offspring of i having the same fitness as 
i. 



When P is a homogeneous population of fitness 1 (ex- 
cept for a randomly chosen mutant, whose fitness is ini- 
tially set to r ^ 1), Pi — P \ {i} and moreover the 
weighting function on every Pi is a constant (thus choos- 
ing j g Pi occurs uniformly at random), this sequence 
of steps is known as the Moran process [a] . In this set- 
ting, evolution can be modeled by a simple discrete-time 
Markov chain, of states 0, 1, . . . , n, in which state s indi- 
cates the existence of s individuals of fitness r, the others 
n — s having fitness 1. 

In this chain, states and n are absorbing and all 
others are transient. If s is a transient state, then it is 
possible either to move from stos-|-lors — 1, with 
probabilities p and q, respectively, such that p/q — r, or 
to remain at state s with probability 1 — p ^ q. When 
r > 1 (an advantageous mutation), the evolution of the 
system has a forward bias; when r < 1 (a disadvanta- 
geous mutation), there is a backward bias. And given 
that the initial state is 1, the probability that the sys- 
tem eventually reaches state n is precisely the fixation 
probability, in this case denoted by pi and given by 



(cf. [^). The probability that the mutation eventually 
becomes extinct (i.e., that the system eventually reaches 
state 0) is 1 — pi. Because pi < 1, extinction is a pos- 
sibility even for advantageous mutations. Similarly, it is 
possible for disadvantageous mutations to spread through 
the entirety of P. 

In order to consider more complex possibilities for Pi, 
we introduce the directed graph D of node set P and edge 
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set containing every ordered pair such that j G Pi- 
The case of a completely connected D (in which every 
node connects out to every other node) corresponds to 
the Moran process. But in the general case, even though 
it continues to make sense to set up a discrete-time 
Markov chain with and n the only absorbing states, 
analysis becomes infeasible nearly always and p must be 
calculated by computer simulation of the evolutionary 
steps. 

The founding work on this graph-theoretic perspective 
for the study of p is [6] , where it is shown that we continue 
to have p = pi for a much wider class of graphs. Specif- 
ically, the necessary and sufficient condition for p = pi 
to hold is that the weighting function be such that, for 
all nodes, the probabilities that result from the incom- 
ing weights sum up to 1 (note that this already holds for 
the outgoing probabilities, thus characterizing a doubly 
stochastic process for out-neighbor selection). In partic- 
ular, if the weighting function is a constant for all nodes 
and a node's in-degree (number of in-neighbors) and out- 
degree (the cardinality of Pi for node i, its number of 
out-neighbors) are equal to each other and the same for 
all nodes, as in the Moran case, then p = pi. 

Other interesting structures, such as scale-free graphs 
[lol |. are also handled in [6|, but the following two ob- 
servations are especially important to the present study. 
The first one is that, if D is not strongly connected (i.e., 
not all nodes are reachable from all others through di- 
rected paths), then p > if and only if all nodes are 
reachable from exactly one of D's strongly connected 
components. Furthermore, when this is the case ran- 
dom drift may be a more important player than natural 
selection, since fixation depends crucially on whether the 
mutation arises in that one strongly connected compo- 
nent. If D is strongly connected, then p > necessarily. 

The second important observation is that there do exist 
structures that suppress random drift in favor of natural 
selection. One of them is the D that in Q is called a 
K-ixnme\ for if > 2 an integer. If n is sufficiently large, 
the value of p for the if-funnel, denoted by px, is 

PK = (2) 

Thus, the if-funnel can be regarded as functionally 
equivalent to the Moran graph with substituting for 
the fitness r. Therefore, the fixation probability can be 
arbitrarily amplified by choosing K appropriately, pro- 
vided r > 1. _ 

Noteworthy additions to the study of [Gj can be found 
in [m, In these works, analytical characterizations 
are obtained for the fixation probability on undirected 
scale-free graphs, both under the dynamics we have de- 
scribed (in which j inherits i's fitness) and the converse 
dynamics (in which it is i that inherits j's fitness). The 
main find is that the fixation probability is, respectively 
for each dynamics, inversely or directly proportional to 
the degree of the node where the advantageous mutation 
appears. 



In this letter, we depart from all previous studies of 
the fixation probability by considering the question of 
whether a mechanism exists for D to be grown from some 
simple initial structure in such a way that, upon reach- 
ing a sufficiently large size, a value of p can be attained 
that substantially surpasses the Moran value pi for an 
advantageous mutation. Such a D might lack the sharp 
amplifying behavior of structures like the .K-funnel, but 
being less artificial might also relate more closely to nat- 
urally occurring processes. We respond affirmatively to 
the question, inspired by the observation discussed above 
on the strong connectedness of D, and using the if-funnel 
as a sieving mechanism to help in looking for promising 
structures. It should be noted, however, that since other 
amplifiers exist with capabilities similar to those of the 
ii'- funnel (e.g., the ii'-superstar jQ]), alternatives to the 
strategy we introduce that are based on them may also 
be possible. 

In a iiT-funnel, nodes are organized into K layers, of 
which layer k contains nodes for some fixed integer 
6 > 2 and fc = 0, 1, . . . , X - 1. It follows that the K- 
funnel has {h^ — l)/(^ ^ 1) nodes. A node in layer k 
connects out to all nodes in layer fc — 1 (modulo K , so 
that an edge exists directed from the single node in layer 
to each of the h^~^ nodes in layer K —1). A if-funnel 
is then, by construction, strongly connected. For a given 
value of n, our strategy for growing D is to make it a 
layered graph like the ii'-funnel, but letting it generalize 
on the if-funnel by allowing each layer to have any size 
(number of nodes), provided no layer remains empty. 

Graph D is the graph that has n nodes in the the 
sequence Dq,Di, ... of directed graphs described next. 
Graph Do has K layers, numbered through K — 1, each 
containing one node. The node in layer k connects out 
to the node in layer fc — 1 (modulo K). For t > an 
integer, Dt+i is obtained from Dt by adding one new 
node, call it i, to a randomly chosen layer, say layer k, 
according to a criterion to be discussed shortly. Node i 
is then connected out to all nodes in layer k — 1 (modulo 
K) and all nodes in layer k+1 (modulo K) are connected 
out to node i. Graph Dt is then strongly connected for 
all t. We note that there are as many possibilities for 
the resulting D as for partitioning n indistinguishable 
objects into K nonempty, distinguishable sets arranged 
circularly, provided we discount for equivalences under 
rotations of the sets. A lower bound on this number 
of possibilities is which for K <^ n is roughly 

Before we describe the rule we use to decide which layer 
is to receive the new node, i, it is important to realize 
that the double stochasticity mentioned earlier implies 
that p = pi for Dq. However, this ceases to hold already 
for Di and may not happen again as the graph gets ex- 
panded. So, whatever the rule is, we are aiming at higher 
p values by giving up on the doubly stochastic character 
of the process whereby fitness propagates through the 
graph. 
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For t > and k any layer of Dt, if we consider the 
layers in the upstream direction from k, we call fc"*" the 
first layer we find whose successor has at most as many 
nodes as itself. In particular, if the successor of layer k 
does not have more nodes than fc, then k^ — k. Now 
let d{k^,k) be the distance from layer fc+ to layer k in 
Dt (i.e., the number of edges on a shortest directed path 
from any node in fc+ to any node in k). 

Layer k is selected to receive node i to yield Dt+i with 
probability 

Pkx[K-d{k+,k)]'' (3) 

for some a > 1. This criterion is loosely suggested by 
the topology of the i^T-funnel. It seeks to privilege first 
the growth of each layer i such that k^ = £ for some 
k, then the growth of the layer k that is immediately 
downstream from £, provided fc+ = £, and so on through 
the other downstream layers. 

In our simulations we use n < I 000 nearly exclusively 
and grow a large number of D samples. The calculation 
of p for a given D involves performing several indepen- 
dent simulations (we use 10 000 in all cases), each one 
starting with the fitness-r mutant substituting for any of 
the n nodes and proceeding as explained earlier until the 
mutation has either spread through all of D's nodes or 
died out (we use constant weighting throughout). The 
fraction of simulations ending in fixation is taken as the 
value of p for that particular D. This calculation can be 
very time-consuming, so we have adopted a mechanism 
to decide whether to proceed with the calculation for a 
given D or to discard it. 

Our mechanism is based on establishing a correlation 
threshold beyond which D is declared sufficiently simi- 
lar to the i^-funnel to merit further investigation. The 
measure of correlation that we use is the Pearson corre- 
lation coefficient between two sequences of the same size, 
which lies in the interval [—1,1] and indicates how closely 
the two sequences are to being linearly correlated (a co- 
efficient of 1 means a direct linear dependence). For se- 
quences X and Y, the coefficient, denoted by C{X, Y), is 
given by C{X,Y) = cov{X,Y)/axcrY, where cov{X,Y) 
is the covariance of X and Y, ax and cry their respective 
standard deviations. 

In our case, X and Y are length- if sequences. If we 
renumber the layers of D so that the layer with the great- 
est number of nodes becomes layer if — 1, the one im- 
mediately downstream from it layer K — 2, and so on 
through layer 0, then we let the sequences X and Y be 
such that Xk = k and Yfc = In , where n}~ is the number 
of nodes in layer k. Notice that, when D is the X-funnel 
itself, then nk = with b > 2, whence Yk = {\nb)Xk 
and C{X,Y) = 1. 

Every D whose sequences X and Y lead C{X, Y) to 
surpass the correlation threshold is as close to having 
rife given by some exponential of k as the threshold al- 
lows. However, the near-linear dependence of the two 
sequences is not enough, since the base of such an ex- 
ponential, which we wish to be as large as possible, can 




Fixation probability 

FIG. 1: (Color online) Simulation results for K = 5. Each 
graph D for which C{X, Y) > 0.9 is represented by its fixation 
probability and by the slope S{X,Y). For each combination 
of n and r, 500 graphs are shown, corresponding roughly to 
12% of the number of graphs that were grown. Dashed lines 
mark pi through pa for r — 1.1, pi for r = 2.0. 



in principle be very small (only slightly above 1), for 
very gently inclined straight lines. On the other hand, a 
steeper straight line indicates a faster reduction of layer 
sizes as we progressively move toward layer from layer 
K — 1 through the other layers. In the analysis that 
follows, then, we also use the slope of the least-squares 
linear approximation of y as a function of X, denoted 
by S{X,Y) and given by S{X,Y) = cov{X, Y)/a%. For 
C{X, Y) close to 1, the base of the aforementioned expo- 
nential approaches e"^'"^^^^. 

Our simulation results are summarized in Fig. [U where 
K = 5,n = 500, 1 000, and r = 1.1, 2.0. For each combi- 
nation and each of four a values (a — 1, 2, 3, 4), a scatter 
plot is given representing each of the graphs generated by 
its fixation probability and the slope S {X, Y) for its two 
sequences, provided C{X,Y) > 0.9. We see that, in all 
cases, strengthening the layer-selection criterion by in- 
creasing a has the effect of moving most of the resulting 
graphs away from the Moran probability (pi) and also 
away from the near-0 slope. 

It is important to notice that, in the absence of the 
slope indicator for each graph, we would be left with a 
possibly wide range of fixation probabilities for the same 
value of a, unable to tell the true likeness of the best 
graphs to the if-funncl without examining their struc- 
tures one by one. In a similar vein, the results shown 
in Fig. [1] emphasize very strongly the role of our par- 
ticular choice of a rule for selecting layers, as opposed 
to merely proceeding uniformly at random. To see this, 
it suffices that we realize that uniformly random choices 
correspond to setting a = in the expression for pk , and 
then we can expect the graphs that pass the correlation 
threshold to be clustered around the points of p ^ pi and 
S{X,Y) - 0. 
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FIG. 2: (Color online) Simulation results for the 3-funnel. 
Dashed lines mark the values of pa. 



results similar to those reported in Fig.[l]can be expected, 
but this remains to be seen. 

In summary, we have demonstrated that strongly con- 
nected layered networks can be grown for which the fixa- 
tion probability significantly surpasses that of the Moran 
process. The growth mechanism we use aggregates one 
new node at a time and chooses the layer to be enlarged 
by the addition of the new node as a function of how far 
layers are from those whose populations are the closest 
upstream local maxima. A great variety of networks can 
result from this process, but we have shown that correlat- 
ing each resulting K-layer network with the i^-funnel ap- 
propriately works as an effective filter to pinpoint those 
of distinguished fixation probability. Further work will 
concentrate on exploring other growth methods and on 
targeting the growth of more general structures. 



We also note a sharp variation in how the fixation prob- 
abilities of the graphs relate to the asymptotic fixation 
probabilities of the /^-funnel as a mutant's fitness is in- 
creased. For r = 1.1, the graphs exhibiting the high- 
est fixation probabilities, and also the highest slopes, are 
such that p is somewhere between p2 and p^. For r = 2.0, 
though, this happens between pi and p2 (= 0.75, not 
shown), therefore providing considerably less amplifica- 
tion. Part of the reason why this happens may be simply 
that the more potent amplifiers are harder to generate by 
our layer-selection mechanism as r is increased. But it 
is also important to realize that, even for the if- funnel, 
achieving a fixation probability near pK requires progres- 
sively larger graphs as r is increased. This is illustrated 
in Fig. [5] for K — 3 and the same two values of r. 

Additional simulation results, for the much larger case 
of if = 10 and n = 10 000, are presented in Fig. [3] for 
r = 1.1 and a = 1,2,3,4. Computationally, this case is 
much more demanding than those of Fig. [U owing mainly 
to the number of distinct networks that can occur, as dis- 
cussed earlier (in fact, for K = 10 and n = 10 000, this 
number is at least of the order of lO'^'^). Consequently, 
many fewer graphs surpassing the 0.9 correlation thresh- 
old were obtained. Even so, one possible reading is that 
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FIG. 3: (Color online) Simulation results for K — 10, n — 
10000, and r = 1.1. Each graph D having C{X,Y) > 0.9 
is represented by its fixation probability and by the slope 
S{X,Y). There are 100 graphs, corresponding roughly to 
0.04% of the graphs that were grown. Dashed lines mark pi 
through p3. 
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